Fix the bug in canonicalize-live-in pass by ShangkunLi · Pull Request #117 · coredac/dataflow

ShangkunLi · 2025-08-20T12:46:45Z

In previous --canonicalize-live-in pass, we traverse the blocks in topological order. But for case like:

module {
  func.func @_Z10bert_node1PA1_A1_A1_A1_A128_bPA1_A128_S1_(%arg0: memref<?x1x1x1x1x128xi8>, %arg1: memref<?x1x128x1x1x128xi8>) attributes {accelerator = "neura", llvm.linkage = #llvm.linkage<external>} {
    %0 = "neura.constant"() <{value = 1 : index}> : () -> index
    %1 = "neura.constant"() <{value = 128 : index}> : () -> index
    %2 = "neura.constant"() <{value = 0 : index}> : () -> index
    %3 = "neura.cast"(%2) <{cast_type = "index_to_int"}> : (index) -> i64
    neura.br %3 : i64 to ^bb1
  ^bb1(%4: i64):  // 2 preds: ^bb0, ^bb5
    %5 = "neura.cast"(%4) <{cast_type = "int_to_index"}> : (i64) -> index
    %6 = "neura.icmp"(%5, %1) <{cmpType = "slt"}> : (index, index) -> i1
    neura.cond_br %6 : i1 then to ^bb2 else to ^bb6
  ^bb2:  // pred: ^bb1
    %7 = "neura.cast"(%2) <{cast_type = "index_to_int"}> : (index) -> i64
    neura.br %7 : i64 to ^bb3
  ^bb3(%8: i64):  // 2 preds: ^bb2, ^bb4
    %9 = "neura.cast"(%8) <{cast_type = "int_to_index"}> : (i64) -> index
    %10 = "neura.icmp"(%9, %1) <{cmpType = "slt"}> : (index, index) -> i1
    neura.cond_br %10 : i1 then to ^bb4 else to ^bb5
  ^bb4:  // pred: ^bb3
    %11 = neura.load_indexed %arg0[%2, %2, %2, %2, %2, %9 : index, index, index, index, index, index] memref<?x1x1x1x1x128xi8> : i8
    neura.store_indexed %11 to %arg1[%2, %2, %5, %2, %2, %9 : index, index, index, index, index, index] memref<?x1x128x1x1x128xi8> : i8
    %12 = "neura.add"(%9, %0) : (index, index) -> index
    %13 = "neura.cast"(%12) <{cast_type = "index_to_int"}> : (index) -> i64
    neura.br %13 : i64 to ^bb3
  ^bb5:  // pred: ^bb3
    %14 = "neura.add"(%5, %0) : (index, index) -> index
    %15 = "neura.cast"(%14) <{cast_type = "index_to_int"}> : (index) -> i64
    neura.br %15 : i64 to ^bb1
  ^bb6:  // pred: ^bb1
    "neura.return"() : () -> ()
  }
}

When we identify the %2 in bb2 as a live-in and wrap it in the block arguments of bb2 and update corresponding operands of neura.cond_br in bb1. Now %2 is also in neura.cond_br and thus a live-in for bb1. But current implementation fails to update the block arguments of bb1.

Thus current implementation results in:

  func.func @_Z10bert_node1PA1_A1_A1_A1_A128_bPA1_A128_S1_(%arg0: memref<?x1x1x1x1x128xi8>, %arg1: memref<?x1x128x1x1x128xi8>) attributes {accelerator = "neura", llvm.linkage = #llvm.linkage<external>} {
    %0 = "neura.constant"() <{predicate = true, value = "%arg0"}> : () -> memref<?x1x1x1x1x128xi8>
    %1 = "neura.constant"() <{predicate = true, value = "%arg1"}> : () -> memref<?x1x128x1x1x128xi8>
    %2 = "neura.constant"() <{value = 1 : i64}> : () -> i64
    %3 = "neura.constant"() <{value = 128 : i64}> : () -> i64
    %4 = "neura.constant"() <{value = 0 : i64}> : () -> i64
    neura.br %4, %3 : i64, i64 to ^bb1
  ^bb1(%5: i64, %6: i64):  // 2 preds: ^bb0, ^bb5
    %7 = "neura.icmp"(%5, %6) <{cmpType = "slt"}> : (i64, i64) -> i1
    neura.cond_br %7 : i1 then %4 : i64 to ^bb2 else to ^bb6
  ^bb2(%8: i64):  // pred: ^bb1
    neura.br %8, %3 : i64, i64 to ^bb3
  ^bb3(%9: i64, %10: i64):  // 2 preds: ^bb2, ^bb4
    %11 = "neura.icmp"(%9, %10) <{cmpType = "slt"}> : (i64, i64) -> i1
    neura.cond_br %11 : i1 then %0, %4, %9, %1, %5, %2, %3 : memref<?x1x1x1x1x128xi8>, i64, i64, memref<?x1x128x1x1x128xi8>, i64, i64, i64 to ^bb4 else %5, %2, %3 : i64, i64, i64 to ^bb5
  ^bb4(%12: memref<?x1x1x1x1x128xi8>, %13: i64, %14: i64, %15: memref<?x1x128x1x1x128xi8>, %16: i64, %17: i64, %18: i64):  // pred: ^bb3
    %19 = neura.load_indexed %12[%13, %13, %13, %13, %13, %14 : i64, i64, i64, i64, i64, i64] memref<?x1x1x1x1x128xi8> : i8
    neura.store_indexed %19 to %15[%13, %13, %16, %13, %13, %14 : i64, i64, i64, i64, i64, i64] memref<?x1x128x1x1x128xi8> : i8
    %20 = "neura.add"(%14, %17) : (i64, i64) -> i64
    neura.br %20, %18 : i64, i64 to ^bb3
  ^bb5(%21: i64, %22: i64, %23: i64):  // pred: ^bb3
    %24 = "neura.add"(%21, %22) : (i64, i64) -> i64
    neura.br %24, %23 : i64, i64 to ^bb1
  ^bb6:  // pred: ^bb1
    "neura.return"() : () -> ()
  }

Therefore, in this pr:

Fix the bug that cannot handle such a case in current --canonicalize-live-in pass
We are able to handle arbitrary control flow now, but may introduce non-sense/redundant phi-grant_predicate-ctrl_mov chain
These redundancies can be removed through --fuse-control-flow in the future. Since they are generated by our canonicalization, they have regular dependencies and are easy to remove.

And the new canonicalized ir looks like:

  func.func @_Z10bert_node1PA1_A1_A1_A1_A128_bPA1_A128_S1_(%arg0: memref<?x1x1x1x1x128xi8>, %arg1: memref<?x1x128x1x1x128xi8>) attributes {accelerator = "neura", llvm.linkage = #llvm.linkage<external>} {
    %0 = "neura.constant"() <{predicate = true, value = "%arg0"}> : () -> memref<?x1x1x1x1x128xi8>
    %1 = "neura.constant"() <{predicate = true, value = "%arg1"}> : () -> memref<?x1x128x1x1x128xi8>
    %2 = "neura.constant"() <{value = 1 : i64}> : () -> i64
    %3 = "neura.constant"() <{value = 128 : i64}> : () -> i64
    %4 = "neura.constant"() <{value = 0 : i64}> : () -> i64
    neura.br %4, %3, %4, %0, %1, %2 : i64, i64, i64, memref<?x1x1x1x1x128xi8>, memref<?x1x128x1x1x128xi8>, i64 to ^bb1
  ^bb1(%5: i64, %6: i64, %7: i64, %8: memref<?x1x1x1x1x128xi8>, %9: memref<?x1x128x1x1x128xi8>, %10: i64):  // 2 preds: ^bb0, ^bb5
    %11 = "neura.icmp"(%5, %6) <{cmpType = "slt"}> : (i64, i64) -> i1
    neura.cond_br %11 : i1 then %7, %6, %8, %9, %5, %10 : i64, i64, memref<?x1x1x1x1x128xi8>, memref<?x1x128x1x1x128xi8>, i64, i64 to ^bb2 else to ^bb6
  ^bb2(%12: i64, %13: i64, %14: memref<?x1x1x1x1x128xi8>, %15: memref<?x1x128x1x1x128xi8>, %16: i64, %17: i64):  // pred: ^bb1
    neura.br %12, %13, %14, %12, %15, %16, %17 : i64, i64, memref<?x1x1x1x1x128xi8>, i64, memref<?x1x128x1x1x128xi8>, i64, i64 to ^bb3
  ^bb3(%18: i64, %19: i64, %20: memref<?x1x1x1x1x128xi8>, %21: i64, %22: memref<?x1x128x1x1x128xi8>, %23: i64, %24: i64):  // 2 preds: ^bb2, ^bb4
    %25 = "neura.icmp"(%18, %19) <{cmpType = "slt"}> : (i64, i64) -> i1
    neura.cond_br %25 : i1 then %20, %21, %18, %22, %23, %24, %19 : memref<?x1x1x1x1x128xi8>, i64, i64, memref<?x1x128x1x1x128xi8>, i64, i64, i64 to ^bb4 else %23, %24, %19, %21, %20, %22 : i64, i64, i64, i64, memref<?x1x1x1x1x128xi8>, memref<?x1x128x1x1x128xi8> to ^bb5
  ^bb4(%26: memref<?x1x1x1x1x128xi8>, %27: i64, %28: i64, %29: memref<?x1x128x1x1x128xi8>, %30: i64, %31: i64, %32: i64):  // pred: ^bb3
    %33 = neura.load_indexed %26[%27, %27, %27, %27, %27, %28 : i64, i64, i64, i64, i64, i64] memref<?x1x1x1x1x128xi8> : i8
    neura.store_indexed %33 to %29[%27, %27, %30, %27, %27, %28 : i64, i64, i64, i64, i64, i64] memref<?x1x128x1x1x128xi8> : i8
    %34 = "neura.add"(%28, %31) : (i64, i64) -> i64
    neura.br %34, %32, %26, %27, %29, %30, %31 : i64, i64, memref<?x1x1x1x1x128xi8>, i64, memref<?x1x128x1x1x128xi8>, i64, i64 to ^bb3
  ^bb5(%35: i64, %36: i64, %37: i64, %38: i64, %39: memref<?x1x1x1x1x128xi8>, %40: memref<?x1x128x1x1x128xi8>):  // pred: ^bb3
    %41 = "neura.add"(%35, %36) : (i64, i64) -> i64
    neura.br %41, %37, %38, %39, %40, %36 : i64, i64, i64, memref<?x1x1x1x1x128xi8>, memref<?x1x128x1x1x128xi8>, i64 to ^bb1
  ^bb6:  // pred: ^bb1
    "neura.return"() : () -> ()
  }

lib/NeuraDialect/Transforms/CanonicalizeLiveInPass.cpp

Fix the bug in canonicalize-live-in pass

fix the bug in canonicalize-live-in

6f94e58

ShangkunLi marked this pull request as ready for review August 20, 2025 12:48

tancheng approved these changes Aug 20, 2025

View reviewed changes

tancheng requested review from HobbitQia and MeowMJ August 20, 2025 13:43

tancheng assigned ShangkunLi Aug 20, 2025

tancheng added bug Something isn't working enhancement New feature or request labels Aug 20, 2025

ShangkunLi added 2 commits August 20, 2025 22:56

[fix] fix some type

84de140

[fix] rename the block -> succ/pred_block

9c26231

tancheng reviewed Aug 20, 2025

View reviewed changes

lib/NeuraDialect/Transforms/CanonicalizeLiveInPass.cpp Outdated Show resolved Hide resolved

tancheng reviewed Aug 20, 2025

View reviewed changes

lib/NeuraDialect/Transforms/CanonicalizeLiveInPass.cpp Show resolved Hide resolved

ShangkunLi added 2 commits August 21, 2025 00:25

[fix] update comments

9a04427

[fix] update comments & variable name

4490e70

tancheng reviewed Aug 20, 2025

View reviewed changes

lib/NeuraDialect/Transforms/CanonicalizeLiveInPass.cpp Show resolved Hide resolved

refactor the logic of live-in propogation

613320e

tancheng approved these changes Aug 21, 2025

View reviewed changes

tancheng merged commit 87342f8 into coredac:main Aug 21, 2025
1 check passed

ShangkunLi mentioned this pull request Aug 22, 2025

[P0] Github verification fails after checking in https://github.com/coredac/dataflow/pull/105 #118

Closed

ShangkunLi linked an issue Aug 25, 2025 that may be closed by this pull request

[P1] Transform Ctrl to Data Flow Error #113

Closed

ShangkunLi mentioned this pull request Aug 25, 2025

[P1] Transform Ctrl to Data Flow Error #113

Closed

ShangkunLi pushed a commit that referenced this pull request Mar 12, 2026

Merge pull request #117 from ShangkunLi/fix-canonicalize

94f2c91

Fix the bug in canonicalize-live-in pass

ShangkunLi pushed a commit that referenced this pull request Mar 12, 2026

Merge pull request #117 from ShangkunLi/fix-canonicalize

e47eed6

Fix the bug in canonicalize-live-in pass

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the bug in canonicalize-live-in pass#117

Fix the bug in canonicalize-live-in pass#117
tancheng merged 6 commits intocoredac:mainfrom
ShangkunLi:fix-canonicalize

ShangkunLi commented Aug 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ShangkunLi commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ShangkunLi commented Aug 20, 2025 •

edited

Loading